Search CORE

1,084 research outputs found

Recommended from our members

Systematic identification and correction of spelling errors in the Foundational Model of Anatomy

Author: Gooch P.
Publication venue
Publication date: 01/01/2011
Field of study

We describe a method for automating the detection and correction of spelling errors in the Foundational Model of Anatomy (FMA). The FMA was tokenized into 4893 distinct words; misspellings were identified and corrected using the National Library of Medicine’s SPECIALIST GSpell Spelling Suggestion API. We identified 43 errors occurring in 97 terms, and 6 words of questionable or inconsistent spelling occurring in 26 terms. These errors are replicated in other reference terminologies that use the FMA. Our approach may be useful as part of a quality assurance process for other large-scale biomedical knowledge resources

City Research Online

Recommended from our members

A lightweight, pattern-based approach to identification and formalisation of TimeML expressions in clinical narratives

Author: Gooch P.
Publication venue
Publication date: 01/01/2012
Field of study

General Architecture for Text Engineering (GATE) components for identifying clinical events and temporal expressions are developed and evaluated against a corpus of 120 discharge summaries

City Research Online

Recommended from our members

BADREX: In situ expansion and coreference of biomedical abbreviations using dynamic regular expressions

Author: Gooch P.
Publication venue: City University London
Publication date
Field of study

BADREX uses dynamically generated regular expressions to annotate term definition–term abbreviation pairs, and corefers unpaired acronyms and abbreviations back to their initial definition in the text. Against the Medstract corpus BADREX achieves precision and recall of 98% and 97%, and against a much larger corpus, 90% and 85%, respectively. BADREX yields improved performance over previous approaches, requires no training data and allows runtime customisation of its input parameters. BADREX is freely available from https://github.com/philgooch/BADREX-Biomedical-Abbreviation- Expander as a plugin for the General Architecture for Text Engineering (GATE) framework and is licensed under the GPLv3

City Research Online

Recommended from our members

Coreference resolution in clinical discharge summaries, progress notes, surgical and pathology reports: a unified lexical approach

Author: Gooch P.
Roudsari A.
Publication venue
Publication date: 01/01/2011
Field of study

We developed a lexical rule-based system that uses a unified approach to resolving coreference across a wide variety of clinical records comprising discharge summaries, progress notes, pathology, radiology and surgical reports from two corpora (Ontology Development and Information Extraction (ODIE) and i2b2/VA) provided for the fifth i2b2/VA shared task. Taking the unweighted mean between 4 coreference metrics, validation of the system against the i2b2/VA corpus attained an overall F-score of 87.7% across all mention classes, with a maximum of 93.1% for coreference of persons, and a minimum of 77.2% for coreference of tests. For the ODIE corpus the overall F-score across all mention classes was 79.4%, with a maximum of 82.0% for coreference of persons and a minimum of 13.1% for coreference of diagnostic reagents. For the ODIE corpus our results are comparable to the mean reported inter-annotator agreement with the gold standard. We discuss the four categories of errors we identified, and how these might be addressed. The system uses a number of reusable modules and techniques that may be of benefit to the research community

City Research Online

Recommended from our members

Computerization of workflows, guidelines and care pathways: a review of implementation challenges for process-oriented health information systems

Author: Gooch P.
Roudsari A.
Publication venue: 'BMJ'
Publication date: 01/01/2011
Field of study

There is a need to integrate the various theoretical frameworks and formalisms for modeling clinical guidelines, workflows, and pathways, in order to move beyond providing support for individual clinical decisions and toward the provision of process-oriented, patient-centered, health information systems (HIS). In this review, we analyze the challenges in developing process-oriented HIS that formally model guidelines, workflows, and care pathways. A qualitative meta-synthesis was performed on studies published in English between 1995 and 2010 that addressed the modeling process and reported the exposition of a new methodology, model, system implementation, or system architecture. Thematic analysis, principal component analysis (PCA) and data visualisation techniques were used to identify and cluster the underlying implementation ‘challenge’ themes. One hundred and eight relevant studies were selected for review. Twenty-five underlying ‘challenge’ themes were identified. These were clustered into 10 distinct groups, from which a conceptual model of the implementation process was developed. We found that the development of systems supporting individual clinical decisions is evolving toward the implementation of adaptable care pathways on the semantic web, incorporating formal, clinical, and organizational ontologies, and the use of workflow management systems. These architectures now need to be implemented and evaluated on a wider scale within clinical settings

City Research Online

Crossref

PubMed Central

Recommended from our members

Automated recognition and post-coordination of complex clinical terms

Author: Gooch P.
Roudsari A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

One of the key tasks in integrating guideline-based decision support systems with the electronic patient record is the mapping of clinical terms contained in both guidelines and patient notes to a common, controlled terminology. However, a vocabulary of pre-coordinated terms cannot cover every possible variation - clinical terms are often highly compositional and complex. We present a rule-based approach for automated recognition and post-coordination of clinical terms using minimal, morpheme-based thesauri, neoclassical combining forms and part-of-speech analysis. The process integrates MetaMap with the open-source GATE framework

City Research Online

Recommended from our members

The therapeutic use of videogames within secure forensic settings: a review of the literature and application to practice

Author: Gooch P.
Living R.
Publication venue
Publication date: 01/08/2004
Field of study

Engagement in leisure pursuits that involves the use of tools and objects and the exploration of a new environment can provide a success experience that leads to increased feelings of competence and mastery. Such experiences are considered important in the rehabilitation of forensic clients. The findings from videogame research within a general population are compared with those among mental health and forensic clients. Within the general population, videogames may provide opportunities for social interaction and the expression of creativity and humour as well as offering a graded approach to building computer skills. Within a forensic population, videogames have been found to be a normalising, age-appropriate and culturally appropriate activity, useful in engaging clients and improving self-concept and locus of control. The findings suggest that videogame play offers access to a safe virtual environment that encourages exploration and mastery and that it may be a useful therapeutic tool in secure settings where such opportunities are often limited. The use and potential contraindications of videogames within a forensic setting, the content of certain games and their possible influence on behaviour and the implications for future research are also discussed

City Research Online

Recommended from our members

A tool for enhancing MetaMap performance when annotating clinical guideline documents with UMLS concepts

Author: Gooch P.
Roudsari A.
Publication venue
Publication date: 01/01/2011
Field of study

We developed a tool that integrates the National Library of Medicine's MetaMap software with GATE, an open-source text an- alytics framework. The tool allows non-ASCII encoded documents of numerous formats to be annotated with UMLS concepts. We created a GATE pipeline to chunk cardiovascular disease guideline text into default segments (blank-line delimited), XML element content, sentences and phrases, which were sequentially submitted to MetaMap for annotation. XML element, sentence and phrase chunking allowed term extraction and mapping to be completed in around 1/3 of the time taken with de- fault chunking, although with slight loss of accuracy (F1.0s=0.94-0.99). However, phrase chunking allows more complex input to be processed in real time, which is not possible with the other approaches. We discuss the results in relation to use of MetaMap's --term processing option for generating pre- and post-coordinated mappings from composite phrases

City Research Online

Recommended from our members

A modular, open-source information extraction framework for identifying clinical concepts and processes of care in clinical narratives

Author: Gooch P.
Publication venue
Publication date
Field of study

In this thesis, a synthesis is presented of the knowledge models required by clinical informa- tion systems that provide decision support for longitudinal processes of care. Qualitative research techniques and thematic analysis are novelly applied to a systematic review of the literature on the challenges in implementing such systems, leading to the development of an original conceptual framework. The thesis demonstrates how these process-oriented systems make use of a knowledge base derived from workflow models and clinical guidelines, and argues that one of the major barriers to implementation is the need to extract explicit and implicit information from diverse resources in order to construct the knowledge base. Moreover, concepts in both the knowledge base and in the electronic health record (EHR) must be mapped to a common ontological model. However, the majority of clinical guideline information remains in text form, and much of the useful clinical information residing in the EHR resides in the free text fields of progress notes and laboratory reports. In this thesis, it is shown how natural language processing and information extraction techniques provide a means to identify and formalise the knowledge components required by the knowledge base. Original contributions are made in the development of lexico-syntactic patterns and the use of external domain knowledge resources to tackle a variety of information extraction tasks in the clinical domain, such as recognition of clinical concepts, events, temporal relations, term disambiguation and abbreviation expansion. Methods are developed for adapting existing tools and resources in the biomedical domain to the processing of clinical texts, and approaches to improving the scalability of these tools are proposed and evalu- ated. These tools and techniques are then combined in the creation of a novel approach to identifying processes of care in the clinical narrative. It is demonstrated that resolution of coreferential and anaphoric relations as narratively and temporally ordered chains provides a means to extract linked narrative events and processes of care from clinical notes. Coreference performance in discharge summaries and progress notes is largely dependent on correct identification of protagonist chains (patient, clinician, family relation), pronominal resolution, and string matching that takes account of experiencer, temporal, spatial, and anatomical context; whereas for laboratory reports additional, external domain knowledge is required. The types of external knowledge and their effects on system performance are identified and evaluated. Results are compared against existing systems for solving these tasks and are found to improve on them, or to approach the performance of recently reported, state-of-the- art systems. Software artefacts developed in this research have been made available as open-source components within the General Architecture for Text Engineering framework

City Research Online

Greenhouse gas reduction and other benefits of biogas upgrading

Author: Gooch Curt A.
Oliver Jason P.
Publication venue
Publication date: 01/06/2016
Field of study

Biogas obtained from anaerobic digestion of livestock manure is a complex mixture containing ~60% methane (CH4) and other less valuable gases. Upgrading the biogas to reduce contaminants and increase the CH4 concentration is advantageous for several reasons

eCommons@Cornell